# Whisper fine-tuning
Kinyawhisper
MIT
KinyaWhisper is a fine-tuned Kinyarwanda automatic speech recognition (ASR) system based on OpenAI's Whisper model, specifically designed for low-resource indigenous languages.
Speech Recognition
Transformers Other

K
benax-rw
149
3
Whisper Small Ta
Apache-2.0
This model is a speech recognition model fine-tuned on the Tamil Common Voice 17.0 dataset based on OpenAI's Whisper Small, with a Word Error Rate (WER) of 43.23%.
Speech Recognition
Transformers Other

W
navin-kumar-j
38
1
Indian Accent English Whisper Finetuned
MIT
Fine-tuned the openai/whisper-large-v3-turbo based on the Indian English accent dataset, which is more suitable for speech recognition of Indian English accents.
Speech Recognition
Transformers English

I
Tejveer12
1,733
1
Quran Whisper Base Fine Tune
Apache-2.0
This model is a fine-tuned Arabic speech recognition model based on openai/whisper-base on the quran-ayat-speech-to-text dataset, specializing in the task of converting Quranic verses from speech to text.
Speech Recognition
Transformers Arabic

Q
Baselhany
35
1
Whisper Base Pl
Apache-2.0
A speech recognition model fine-tuned on the Polish Common Voice 17.0 dataset based on OpenAI Whisper-base
Speech Recognition
Transformers Other

W
marcsixtysix
27
1
Viwhisper Medium
MIT
Whisper-medium model optimized for Vietnamese speech recognition tasks, fine-tuned on 1308 hours of Vietnamese data
Speech Recognition
Transformers Other

V
NhutP
139
4
Whisper Large V3 Cantonese
Apache-2.0
A Cantonese automatic speech recognition model fine-tuned on Whisper v3, trained on the Common Voice 17 dataset
Speech Recognition
Transformers Other

W
khleeloo
25
4
Akan Whisper Model
A fine-tuned version of OpenAI's Whisper model, specifically designed for automatic speech recognition tasks in the low-resource Ghanaian language Akan
Speech Recognition
Transformers Other

A
GiftMark
354
3
Whisper Small Khmer
MIT
A speech recognition model fine-tuned based on openai/whisper-small, specifically optimized for Khmer transcription accuracy
Speech Recognition
Transformers Other

W
Vira21
15
1
Whisper Tiny Myanmar
Apache-2.0
This model is an automatic speech recognition (ASR) model fine-tuned on Burmese speech datasets based on openai/whisper-tiny, supporting Burmese speech-to-text tasks.
Speech Recognition
Transformers Other

W
chuuhtetnaing
84
1
Whisper Large V3 Myanmar
Apache-2.0
This model is an automatic speech recognition model fine-tuned on the Burmese speech dataset based on openai/whisper-large-v3, specifically designed for Burmese speech transcription.
Speech Recognition
Transformers Other

W
chuuhtetnaing
172
1
Monsoon Whisper Medium Gigaspeech2
Apache-2.0
Monsoon-Whisper-Medium-GigaSpeech2 is a Thai automatic speech recognition (ASR) model, based on Whisper-Medium and fine-tuned on the GigaSpeech2 dataset, suitable for speech recognition in real-world scenarios.
Speech Recognition
Transformers

M
scb10x
546
5
Akylai STT Small
Apache-2.0
Kyrgyz Whisper ASR is a customized automatic speech recognition solution specifically designed for the Kyrgyz language, fine-tuned based on the pre-trained Whisper model.
Speech Recognition
Transformers Other

A
the-cramer-project
73
1
Whisper Large V3 Taiwanese Hakka
A Whisper-large-v3 fine-tuned model for Taiwanese Hakka speech recognition, supporting multiple Hakka dialects
Speech Recognition
Transformers Other

W
formospeech
41
5
Detect Language
Apache-2.0
A language identification model fine-tuned based on the Whisper Medium model, specifically designed for language classification tasks on the FLEURS dataset
Audio Classification
Transformers

D
apparaomulpuriril
15
0
Whisper Sinhala Audio To Text
Apache-2.0
A Sinhala speech recognition model fine-tuned based on openai/whisper-small, supporting conversion of Sinhala speech to text.
Speech Recognition
Transformers

W
AqeelShafy7
229
2
Whisper Small Kyrgyz
Kyrgyz automatic speech recognition (ASR) model based on the Whisper architecture, developed with support from the National Commission on Language and Language Policy under the President of the Kyrgyz Republic
Speech Recognition
Transformers Other

W
UlutSoftLLC
841
4
Whisper Tiny Vi
Apache-2.0
Vietnamese automatic speech recognition (ASR) model fine-tuned based on OpenAI Whisper-tiny architecture, demonstrating excellent performance on multiple Vietnamese datasets
Speech Recognition
Transformers Other

W
doof-ferb
44
2
Phowhisper Medium
Bsd-3-clause
PhoWhisper is a series of models designed specifically for Vietnamese automatic speech recognition (ASR). It achieves high robustness by fine-tuning the Whisper model on an 844-hour Vietnamese accent dataset.
Speech Recognition
Transformers Other

P
vinai
2,999
10
Phowhisper Small
Bsd-3-clause
PhoWhisper is a system specifically designed for Vietnamese automatic speech recognition, fine-tuned based on the Whisper model, supporting various Vietnamese accents.
Speech Recognition
Transformers Other

P
vinai
2,725
8
Phowhisper Large
Bsd-3-clause
PhoWhisper is a system specifically designed for Vietnamese automatic speech recognition, fine-tuned based on the Whisper model, supporting various Vietnamese accents.
Speech Recognition
Transformers Other

P
vinai
2,373
28
Whisper Small Fa
The Whisper (small) model fine-tuned by the Hezar team based on the Persian part of the Common Voice dataset, which can be used for automatic speech recognition tasks.
Speech Recognition Other
W
hezarai
363
11
Whisper Large V2 Spanish
Apache-2.0
A speech recognition model fine-tuned on the Common Voice 13.0 Spanish dataset based on OpenAI Whisper-large-v2
Speech Recognition
Transformers

W
Sandiago21
38
3
Asr Whisper Medium Commonvoice Fa
Apache-2.0
A fine-tuned whisper medium model based on the CommonVoice-14.0 Persian dataset for Persian automatic speech recognition tasks.
Speech Recognition Other
A
speechbrain
21
3
Banglaasr
MIT
This is a Bengali automatic speech recognition model based on the Whisper small architecture, fine-tuned on approximately 400 hours of Mozilla Common Voice dataset with a word error rate of 4.58%
Speech Recognition
Transformers

B
bangla-speech-processing
782
15
Afrispeech Large A100
An African language speech recognition model fine-tuned from Whisper-large-v2, trained on the afrispeech-200 dataset with a word error rate (WER) of 14.81
Speech Recognition
Transformers

A
Seyfelislem
20
1
Whisper Medium Arabic
Apache-2.0
An Arabic speech recognition model fine-tuned based on openai/whisper-medium, supporting streaming processing
Speech Recognition
Transformers

W
Seyfelislem
1,832
5
Whisper Large V2 Spanish
Apache-2.0
Spanish speech recognition model fine-tuned based on openai/whisper-large-v2, achieving 8.55% WER on Common Voice 11.0 Spanish test set
Speech Recognition
Transformers

W
clu-ling
85
2
Whisper Large V2 Kazakh
Apache-2.0
This model is a fine-tuned speech recognition model based on OpenAI's Whisper Large V2 on the Kazakh Common Voice 11.0 dataset
Speech Recognition
Transformers Other

W
DrishtiSharma
40
3
Whisper Medium Portuguese
Apache-2.0
A Portuguese speech recognition model fine-tuned on the common_voice_11_0 dataset based on openai/whisper-medium, with a word error rate of 6.5987
Speech Recognition
Transformers Other

W
pierreguillou
191
28
Featured Recommended AI Models